Robustification of the k-means clustering problem and tailored decomposition methods: when more conservative means more accurate
نویسندگان
چکیده
Abstract k -means clustering is a classic method of unsupervised learning with the aim partitioning given number measurements into clusters. In many modern applications, however, this approach suffers from unstructured measurement errors because result then represents erroneous instead retrieving true underlying structure. We resolve issue by applying techniques robust optimization to hedge against in observed data. To end, we derive strictly and $$\Gamma $$ ? -robust counterparts problem. Since nominal problem already NP-hard, global approaches are often not feasible practice. As remedy, develop tailored alternating direction methods decomposing search space as well robustified problems quickly obtain points good quality. Our numerical results reveal an interesting feature: less conservative -approach clearly outperformed method. particular, able recover clusterings original data even if only observed.
منابع مشابه
More work on K -Means Clustering Algorithm: The Dimensionality Problem
The K-means clustering algorithm is an old algorithm that has been intensely researched owing to its simplicity of implementation. However, there have also been criticisms on its performance, in particular, for demanding the value of K a priori. It is evident from previous researches that providing the number of clusters a priori does not in any way assist in the production of good quality clus...
متن کاملMore Work on K -Means Clustering Algorithm:
The K-means clustering algorithm is an old algorithm that has been intensely researched owing to its simplicity of implementation. However, there have also been criticisms on its performance, in particular, for demanding the value of K a priori. It is evident from previous researches that providing the number of clusters a priori does not in any way assist in the production of good quality clus...
متن کاملCell Growth: When Less Means More
When is less more? A new study reveals that decreased mitochondrial gene expression and reduced lipid biosynthesis may actually increase cell growth.
متن کاملMore GC Means More RNA
Background: Tuberculosis treatment failure and death rates are low in the Western Pacific Region, including Vietnam. However, failure or death may also occur among patients who did not complete treatment, i.e. reported as default or transfer-out. We aimed to assess the proportion failures and deaths among new smear-positive pulmonary tuberculosis patients with reported default or transfer-out.
متن کاملPersistent K-Means: Stable Data Clustering Algorithm Based on K-Means Algorithm
Identifying clusters or clustering is an important aspect of data analysis. It is the task of grouping a set of objects in such a way those objects in the same group/cluster are more similar in some sense or another. It is a main task of exploratory data mining, and a common technique for statistical data analysis This paper proposed an improved version of K-Means algorithm, namely Persistent K...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Annals of Operations Research
سال: 2022
ISSN: ['1572-9338', '0254-5330']
DOI: https://doi.org/10.1007/s10479-022-04818-w